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This Application is a. ContinuationHn Part of U.S.Application Serial No. 
08/598,591 filed on February 12,^996. 

CODING SEQUENCES OF THE HUMAN BRCA1 GENE 

5 

FIELD OF THE TNVFNTTON 

This invention relates to a gene which has been associated with breast and 
ovarian cancer where the gene is found to be mutated. 'More specifically, this 
invention relates to the three coding sequences of the BRCA1 gene BRCAl( omi1 )' 
10 BRCAK°^>, and BRCAK°™3)) isolated from human subjects. 

gACKGRQUNP OF THE INVENTION 

It has been estimated that about 5-10% of breast cancer is inherited Rowell, 
S., et al, American Journal of Human Genetics 5£:861-865 (1994). Located on 

15 chromosome 17, BRCA1 is the first gene identified to be conferring increased risk 
for breast and ovarian cancer. Miki et al, ScienceJ£fi:66-7l (1994). Mutations in 
this "tumor suppressor" gene are thought to account for roughly 45% of 
inherited breast cancer and 80-90% of families with increased risk of early onset 
breast and ovarian cancer. Easton et al, American Journal of Human Genetics 

20 52:678-701(1993). 

Locating one or more mutations in the BRCA1 region of chromosome 17 
provides a promising approach to reducing the high incidence and mortality 
associated with breast and ovarian cancer through the early detection of women 
at high risk. These women, once identified, can be targeted for more aggressive 

25 prevention programs. Screening is carried out by a variety of methods which 
include karyotyping, probe binding and DNA sequencing. 

In DNA sequencing technology, genomic DNA is extracted from whole 
blood and the coding sequences of the BRCA1 gene are amplified. The coding 
sequences might be sequenced completely and the results are compared to the 

30:.. DNA sequence of the gene. Alternatively, the coding sequence of the sample 
gene may be compared to a panel of known mutations before completely 
sequencing the gene and comparing it to a normal sequence of the gene. 
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If a mutation in the BRCA1 coding sequence is found, it may be possible to 
provide the individual with increased expression of the gene through gene, 
transfer therapy. It has- been demonstrated that the gene transfer of the BRCA1 
coding sequence into cancer cells inhibits their growth and reduces 

5 tumorigenesis of human cancer cells in nude mice. Jeffrey Holt and his 
colleagues-conclude' that the product of BRCA1 expression is a secreted tumor 
growth inhibitor, making BRCA1 an ideal gene for gene therapy studies. 
Transduction of only a moderate percentage of tumor cells apparently produces 
enough growth inhibitor to inhibit all tumor cells. Arteaga, CL, and JT Holt 

10 Cancer Research 56: 1098-1103 (1996), Holt, JT et aL, Nature Genetics 12: 298-302 
(1996). 

The observation of Holt et al, that the BRCA1 growth inhibitor is a 
secreted protein leads to- the possible use of injection of the growth inhibitor, into 
the area of the tumor for tumor suppression. 
15 The BRCA1 gene is divided into 24 separate exons. Exons 1 and 4 are 

noncoding, in that they are not part of the final functional BRCA1 protein 
. product. The BRCA1 coding sequence spans roughly 5600 base pairs (bp). Each 
exon consists of 200-400 bp, except for exon 11 which contains about 3600 bp. To 
sequence the coding sequence of the BRCA1 gene, each exon is amplified 
separately and the resulting PGR products are sequenced in the forward and 
reverse directions. Because exon 11 is so large, we have divided it into twelve 
overlapping PCR fragments of roughly 350 bp each (segments "A" through "L" of 
BRCA1 exon 11). 

Many mutations and polymorphisms have already been reported in the 
25 BRCA1 gene. A world wide web site has been built to facilitate the detection and 
characterization of alterations in breast cancer susceptibility genes. Such 
mutations in BRCA1 can be accessed through the Breast Cancer Information Core 
at: http://www.nchgr.nih.gov/dir/lab_transfer/bic. This data site became 
publicly available on November 1, 1995. Friend, S. et aL Nature Genetics 11:238, 
30 (1995). 
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The genetics of Breast/Ovarian Cancer Syndrome is autosomal dominant 
with reduced penetrance. In simple terms/this means that the syndrome runs 
through families such that both sexes can be carriers (only women get the disease 
but men can pass it on), all generations will likely have breast/ovarian or both 
5 diseases and sometimes in the same individual, occasionally women carriers 
either die young before they have the time to manifest disease (and yet offspring 
get it) or they never develop breast or ovarian cancer and die of old age (the latter 
people are said to have "reduced penetrance" because they never develop cancer). 
Pedigree analysis and genetic counseling is absolutely essential to the proper 

0 workup of a family prior to any lab work. 

Until now, only a single coding sequence for the BRCA1 gene has been 
available for comparison to patient samples. That sequence is available as 
GenBank Accession Number U14680. There is a need in the art, therefore, to 
have available a coding sequence which is the BRCA1 coding sequence found in 

15 the majority of the population, a "consensus coding sequence", BRCAK°n>n) s e q. 
ID. NO. 1. A consensus coding sequence will make it possible for true mutations 
to be easily identified or differentiated from polymorphisms. Identification of 
mutations of the BRCA1 gene and protein would allow more widespread 
diagnostic screening for hereditary breast and ovarian cancer than is currently 

20 possible. Two additional coding sequences have been isolated and characterize. 
The BRCAl(omi2) SEQ. ID. NO.: 3, and BRCAl«>mi3) SEQ. ID. NO.:5 codin* 
sequences also have utility in diagnosis, gene therapy and in making therapeutic 
BRCA1 protein. 

A coding sequence of the BRCA1 gene which occurs most commonly in 
25 the human gene pool is provided. The most commonly occurring coding 
sequence more accurately reflects the most likely sequence to be found in a 
subject. Use of the coding sequence BRCAK°™i) SEQ. ID. NO.: 1, rather than the 
previously published BRCA1 sequence, will reduce the likelihood of 
misinterpreting a . "sequence variation" found in the population (i.e. 
30 polymorphism) with a pathologic "mutation" (i.e. causes disease in the 
individual or puts the individual at a high risk of developing the disease). With 
large interest in breast cancer predisposition testing, misinterpretation is 
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particularly worrisome. People who already have breast cancer are asking the 
clinical question: "is my disease caused by'a heritable genetic mutation?" The 
relatives of the those with breast cancer are asking the question: "Am I also a 
carrier of the mutation my relative has? Thus, is my risk increased, and should I 
undergo a more aggressive surveillance program." 

SUMMARY OF THE INVPNITTOM 

The present invention is based on the isolation of three coding sequences 
of the BRCA1 gene found in human individuals. 

It. is an object of the invention to provide the most commonly occurring 
coding sequence of the BRCA1 gene. 

It is another object of this invention to provide two other coding 
sequences of BRCA1 gene. 

It is another object of the invention to provide three protein sequences 
15 coded for by three of the coding sequences of the BRCA1 gene. 

It is another object of the invention to provide a list of the codon pairs 
which occur at each of seven polymorphic points on the BRCA1 gene. 

It is another object of the invention to provide the rates of occurrence for 
the codons. 

20 It is another object of the invention to provide a method wherein BRCA1, 

or parts thereof, is amplified with one or more oligonucleotide primers. 

It is another object of this invention to provide a method of identifying 
individuals who carry no mutation(s) of the BRCA1 coding sequence and 
therefore have no increased genetic susceptibility to breast or ovarian cancer 
25 based on their BRCA1 genes. 

It is another object of this invention to provide a method of identifying a 
mutation leading to an increased genetic susceptibility to breast or ovarian 
cancer. 

There is a need in the art for a sequence of the BRCA1 gene and for the 
30 protein sequence of BRCA1 as well as for an accurate list of codons which occur at 
polymorphic points on a sequence. 
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A person skilled in the art of genetic susceptibility testing will find the present 
invention" useful for: 

a) identifying individuals having a BRCA1 gene with no coding 

mutations, who therefore cannot be said to have an increased 

5 genetic susceptibility to breast or ovarian cancer from their BRCA1 

genes; 

b) avoiding misinterpretation of polymorphisms found in the BRCA1 
gene; 

c) determining the presence of a previously unknown mutation in the 
10 BRCAlgene. 

d) identifying a mutation which increases the genetic susceptibility t o 
breast or ovarian cancer. 

e) probing a human sample of the BRCA1 gene. 

f) performing gene therapy. 

15 S) for making a functioning tumor growth inhibitor protein coded for 

by one of the BRCA1°»* genes. 

BRIEF DESCRIPTION of tttf nr.Tip^ 

As shown in FIGURE 1, the alternative alleles at polymorphic (non-mutation 
20 causing variations) sites along a chromosome can be represented as a "haplotype" 
within a gene such as BRCA1. The BRCAK°™i) haplotype is shown in Figure 1 
with dark shading (encompassing the alternative alleles found at nucleotide sites 
2201, 2430, 2731, 3232, 3667, 4427, and 4956). For comparison, the haplotype that is 
in GenBank is shown with no shading. As can be seen from the figure, the 
25 common "consensus" haplotype is found intact in five separate chromosomes 
labeled with the OMI symbol (numbers 1-5 from left to right). Two additional 
haplotypes (BRCAl( omi2 >, and BRCAl(°mi3) are represented with mixed dark and 
light shading (numbers 7 and 9 from left to right). In total, 7 of 10 haplotypes 
along the BRCA1 gene are unique. 

30 
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DETAILED D ESCRIPTION OF THE INVENTION 



DEFINITIONS 

The following definitions are provided for the purpose of understanding this 
5 invention. 

"Breast and Ovarian cancer" is understood by those skilled in the art to 
include breast and ovarian cancer in women and also breast and prostate cancer 
in men. BRCA1 is associated genetic susceptibility to inherited breast and 
10 ovarian cancer in women and also breast and prostate cancer in men. Therefore, 
claims in this document which recite breast and /or ovarian cancer refer to breast, 
ovarian and prostate cancers in men and women. 

" Coding sequence'' or " DNA coding sequence" refers to those portions of a 
15 gene which, taken together, code for a peptide (protein), or which nucleic acid 
itself has function. 

Protein" or "peptide" refers to a sequence amino acids which has 
function. 

20 "BRCAl(° mi )" refers collectively to the "BRCAl< omi1 )", "BRCAK 0 ^)" and 

"BRCAK 01 **)" coding sequences. 

"BRCAK omn >" refers to SEQ. ID. NO.: 1, a coding sequence for the BRCA1 
gene. The coding sequence was found by end to end sequencing of BRCA1 alleles 
25 from individuals randomly drawn from a Caucasian population found to have 
no family history of breast or ovarian cancer. The sequenced gene was found not 
to contain any mutations. BRCAK omil > was determined to be a consensus 
sequence by calculating the frequency with which the coding sequence occurred 
among, the sample alleles sequenced. 

30 

"BRCAl(o^)" and."BRCAl( om *)" refer to SEQ. ID. NO.: 3, and SEQ. ID. NO.: 
5 respectively. They are two additional coding sequences for the BRCA1 gene 
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which were also isolated from individuals randomly drawn from a Caucasian 
population found to have no family history of breast or ovarian cancer, 
polymorphisms 

"Primer" as used herein refers to a sequence comprising about 20 or more 
nucleotides of the BRCA1 gene. 

"Genetic susceptibility" refers to the susceptibility to breast or ovarian 
cancer due to the presence of a mutation in the BRCA1 gene. 

A "target polynucleotide" refers to the nucleic acid sequence of interest e.g., 
the BRCA1 encoding polynucleotide. Other primers which can be used for 
primer hybridization will be known or readily ascertainable to those of skill in 
the art. 

"Consensus" means the most commonly occurring in the population. 
"Consensus genomic sequence" means the allele of the target gene which 
occurs with the greatest frequency in a population of individuals having no 
family history of disease associated with the target gene. 



"Substantially complementary to" refers to a probe or primer sequences 
which hybridize to the sequences provided under stringent conditions and /or 
sequences having sufficient homology with BRCA1 sequences, such that the 
allele specific oligonucleotide probe or primers hybridize to the BRCA1 sequences 
25 to which they are complimentary. 

"Haplotype" refers to a series of alleles within a gene on a chromosome. 

"Isolated" as used herein refers to substantially free of other nucleic acids, 
30 proteins, lipids, carbohydrates or other materials with which they may be 
associated. Such association is typically either in cellular material or in a 
synthesis medium. 
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"Mutation" refers to a base change or a gain or loss of base pair(s) in a DNA 
sequence, which results in a DNA sequence which codes for a non-functioning 
protein or a protein with substantially reduced or altered function. 

"Polymorphism" refers to a base change which is not associated with 
known pathology. 

"Tumor growth inhibitor protein" refers to the protein coded for by the 
BRCA1 gene. The functional protein is thought to suppress breast and ovarian 
tumor growth. 

The invention in several of its embodiments includes: 

1. An isolated consensus DNA sequence of the BRCA1 coding sequence as set 
forth in SEQ. ID. NO.: 1. 

2. A consensus protein sequence of the BRCA1 protein as set forth in 
SEQ. ID. NO.: 2. 

3. An isolated coding sequence of the BRCA1 gene as set forth in 
20 SEQ. ID. NO.: 3. 

4. A protein sequence of the BRCA1 protein as set forth in 
SEQ. ID. NO.: 4 . 

25 5. An isolated coding sequence of the BRCA1 gene as set forth in 
SEQ. ID. NO.: 5. 

6. A protein sequence of the BRCA1 protein as set forth in SEQ. ID. NO.: 6. 

30 

7. A BRCA1 gene with a BRCA1 coding sequence not associated with 
breast or ovarian cancer which comprises an, alternative pair of codons, AGC 

8 
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and AGT, which occur at position 2201 at frequencies of about 35-45%,- and 
from about 55-65%, respectively. 

8. A BRCA1 gene according to Claim 7 wherein AGC occurs at a 
5 frequency of about 40%. 

9. A set of at least two alternative codon pairs which occur at 
polymorphic positions in a BRCA1 gene with a BRCA1 coding sequence not 
associated with breast or ovarian cancer, wherein codon pairs are selected 

10 from the group consisting of: 

• AGC and AGT at position 220 1 : 

• TTG and CTG at position 2430; 
CCG and CTG at position 2731; 

• GAA and GGA at position 3232: 
15 • AAA and AGA at position 3667; 

TCT and TCC at position 4427; and 

• AGT and GGT at position 4956. 

10. A set of at least two alternative codon pairs according to claim 9, 
20 wherein the codon pairs occur in the following frequencies, respectively, in a 
population of individuals free of disease: 

• at position 2201 , AGC and AGT occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 

at position 2430. TTG and CTG occur at frequencies from about 
25 35-45%, and from about 55-65%, respectively; 

• at position 273 1 , CCG and CTG occur at frequencies from about 
25-35%, and from about 65-75%, respectively; 

• at position 3232, GAA and GGA occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 

30 • at position 3667, AAA and AGA occur at frequencies from about 

35-45%, and from about 55-65%. respectively; 

• at position 4427, TCT and TCC occur at frequencies from about 
45-55%, and from about 45-55%, respectively: and 

• at position 4956, AGT and GGT occur at frequencies from about 
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35-45%, and from about 55-65%, respectively. 



11 A set according to Claim 1Q which is at least three codon pairs. 



5 12 A set according to Claim 10 which is at least four codon pairs. 



13. A set according to Claim 10 which is at least five codon pairs. 
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14. A set according to Claim 10 which is at least six codon pairs. 



15 A set according to Claim 10 which is at least seven codon pairs. 



16. A method of identifying individuals having a BRCA1 gene with a 
BRCA1 coding sequence not associated with disease, comprising: 
5 (a) amplifying a DNA fragment of an individual's BRCA1 

coding sequence using an oligonucleotide primer which 
specifically hybridizes to sequences within the gene; 
(b) sequencing said amplified DNA fragment by dideoxy 
sequencing; 

50 (c) repeating steps (a) and (b) until said individual's BRCA1 

coding sequence is completely sequenced; 
(d) comparing the sequence of said amplified DNA fragment 
to a BRCAl(° mi > DNA sequence, SEQ. ID. NOl, SEQ. ID. 
N03, or SEQ. ID. N05; 
25 (e) determining the presence or absence of each of the 

following polymorphic .variation in said individual's 
BRCA1 coding sequence: 
• AGC and AGT at position 2201. 
TTG and CTG at position 2430. 
30 • CCG and CTG at position 2731. 

GAA and GGA at position 3232, 
AAA and AGA at position 3667, 
TCT and TCC at position 4427, and 
AGT and GGT at position 4956; 
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(f) determining any sequence differences between said 
individual's BRCA1 coding sequences and SEQ. ID. NOl, 
SEQ. ID. N03, or SEQ. ID. N05 wherein the presence of 
said polymorphic variations and the absence of a 
5 variation outside of positions 2201, 2430, 2731, 3232, 

3667, 4427 t and 4956, is correlated with an absence of 
increased genetic susceptibility to breast or ovarian 
cancer resulting from a BRCA1 mutation in the BRCA1 
coding sequence. 

10 . 

17, A method of claim 16 wherein, codon variations occur at the 
following frequencies, respectively, in a population of individuals free of 
disease: 

• at position 2201, AGC and AGT occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 

• at position 2430, TTG and CTG occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 

• at position 2731, CCG and CTG occur at frequencies from about 
: 25-35%, and from about 65-75%, respectively; 

• at position 3232, GAA and GGA occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 

• at position 3667, AAA and AGA occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 

• at position 4427, TCT and TCC occur at frequencies from about 
45-55%, and from about 45-55%, respectively; and 

• at position 4956, AGT and GGT occur at frequencies from about 
35-45%, and from about 55-65%, respectively. 

30 18. A method according to claim 16 wherein said oligonucleotide primer is 
labeled with a radiolabel, a fluorescent label a bioluminescent label, a 
chemiluminescent label, or an enzyme label. 

19. A method of detecting a increased genetic susceptibility' to breast and 
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ovarian cancer in an individual resulting from the presence of a mutation in 
the BRCA1 coding sequence, comprising: 

(a) amplifying a DNA fragment of an individual's BRCA1 
coding sequence using an oligonucleotide primer which 
specifically hybridizes to sequences within the gene: 

(b) sequencing said amplified DNA fragment by dideoxy 
sequencing: 

(c) repeating steps (a) and (b) until said individual's BRCA1 
coding sequence is completely sequenced; 

(d) comparing the sequence of said amplified DNA fragment 
to a BRCAl< om « DNA sequence, SEQ. ID. NOl. SEQ. ID. 
N03, or SEQ. ID. N05; 

(e) determining any sequence differences between said 
individual's BRCA1 coding sequences and SEQ. ID. NOl, 
SEQ. ID. N03, or SEQ. ID. N05: to determine the 
presence or absence of base changes in said individual's 
BRCA1 coding sequence wherein a base change which is 
not any one of the following: 

AGC and AGT at position 2201. 
TTG and CTG at position 2430, 
CCG and CTG at position 273 1 . 
GAA and GGA at position 3232, 
AAA and AGA at position 3667. 
TCT and TCC at position 4427, and 
AGT and GGT at position 4956 is correlated with 
the potential of increased genetic susceptibility to 
breast or ovarian cancer resulting from a BRCA1 
mutation in the BRCA1 coding sequence. 



30 20. A method of claim 19 wherein, codon variations occur at the following 
frequencies, respectively, in a population free of disease: 

• at position 220 1 . AGC and AGT occur at frequencies from about 
40%, and from about 55-65%, respectively; 

• at position 2430. TTG and CTG occur at frequencies from about 
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35-45%, and from about 55-65%, respectively; 

• • at position 2731, CCG and CfG occur at frequencies from about 

25-35%, and from about 65-75%, respectively; 

• at position 3232, GAA and GGA occur at frequencies from about 
5 35-45%, and from about 55-65%, respectively; 

• at position 3667, AAA and AGA occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 

• at position 4427, TCT and TCC occur at frequencies from about 
45-55%, and from about 45-55%, respectively; and 

10 • at position 4956, AGT and GGT occur at frequencies from about 

35-45%, and from about 55-65%, respectively. 

21. A method according to claim 19 wherein said oligonucleotide primer is 
labeled with a radiolabel, a fluorescent label a bioluminescent label, a 

15 chemiluminescent label, or an enzyme label. 

22. A set of codon pairs, which occur at polymorphic positions in a 
BRCA1 gene with a BRCA1 coding sequence according to Claim 1, wherein 
said set of codon pairs is: 

20 • AGC and AGT at position 220 1 ; 

• TTG and CTG at position 2430; 
CCG and CTG at position 2731; 
GAA and GGA at position 3232; 

• AAA and AGA at position 3667; 

25 • TCT and TCC at position 4427; and 

• AGT and GGT at position 4956. 

23. A set of at least two alternative codon pairs according to claim 22 
wherein set of at least two alternative codon pairs occur at the following 

30 frequencies: 

• at position 2201, AGC and AGT occur at frequencies of about 
40%, and from about 55-65%, respectively; 

• at position 2430, TTG and CTG occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 

13 



• at position 273 1 , CCG and CTG occur at frequencies from about 
25-35%, and from about 65-75%, respectively; 

• at position 3232, GAA and GGA occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 

at position 3667, AAA and AGA occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 
at position 4427, TCT and TCC occur at frequencies from about 
45-55%, and from about 45-55%, respectively; and 

• at position 4956, AGT and GGT occur at frequencies from about 
35-45%, and from about 55-65%, respectively. 

24. A BRCA1 coding sequence according to claim 1 wherein the codon 
pairs occur at the following frequencies: 

at position 2201, AGC and AGT occur at frequencies of about 

40%, and from about 55-65%, respectively; 

at position 2430, TTG and CTG occur at frequencies from about 

35-45%, and from about 55-65%, respectively; 

at position 2731, CCG and CTG occur at frequencies from about 

25-35%, and from about 65-75%, respectively; 

at position 3232, GAA and GGA occur at frequencies from about 

35-45%, and from about 55-65%, respectively; 

• at position 3667, AAA and AGA occur at frequencies from about 
35-45%, and from about 55-65%, respectively; 

at position 4427, TCT and TCC occur at frequencies from about 
45-55%, and from about 45-55%, respectively; and 

• at position 4956, AGT and GGT occur at frequencies from about 
35-45%, and from about 55-65%, respectively. 

25. A method of determining the consensus genomic sequence or consensus 
coding sequence for a target gene, comprising: 

a) screening a number of individuals in a population for a family history 
which indicates inheritance of normal alleles for a target gene; 

b) isolating at least one allele of the target gene from individuals found to 
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have a family history which indicates inheritance of normal alleles for a target 
gene; 

c) sequencing each allele; 

d) comparing the nucleic acid sequence of the genomic sequence or of the 
5 coding sequence of each allele of the target gene to determine similarities and 

differences in the nucleic acid sequence; and 

e) determining which allele of the target gene occurs with the greatest 
frequency. 

10 26. A method of performing gene therapy, comprising: 

a) transfecting cancer cell in vivo with an effective amount of a 
vector transformed with a BRCA1 coding sequences of SEQ. 
ID. NO.: 1, SEQ. ID. NO.: 3, or SEQ. ID. NO.: 5; 

b) allowing the cells to take up the vector, and 
15 c) measuring a reduction in tumor growth. 

27. A method of performing protein therapy, comprising: 

a) injecting into a patient, an effective amount of BRCA1 tumor 
growth inhibiting protein of SEQ. ID. NO.: 2, SEQ. ID. NO.: 

20 4, or SEQ. ID. NO.: 6; 

b) allowing the cells to take up the protein, and 

c) measuring a reduction in tumor growth. . 

SEQUENCING 

25 Any nucleic acid specimen, in purified or non-purified form, can be 

utilized as the starting nucleic acid or acids, providing it contains, or is suspected 
of containing, the specific nucleic acid sequence containing a polymorphic locus. 
Thus, the process may amplify, for example, DNA or RNA, including messenger 
RNA, wherein DNA or RNA may be single stranded or double stranded. In the 

30 event that RNA is to be used as a template, enzymes, and /or conditions optimal 
for reverse transcribing the template to DNA would be utilized. In addition, a 
DNA-RNA hybrid which contains one strand of each may be utilized. A mixture 

15 



of nucleic acids may also be employed, or the nucleic acids produced in a" 
previous amplification reaction herein, using the same or different primers may 
be so utilized. See TABLE II. The specific nucleic acid sequence to be amplified, 
i.e., the polymorphic locus, may be a fraction of a larger molecule or can be 
5 present initially as a discrete molecule, so that the specific sequence constitutes 
the entire nucleic acid. It is not. necessary that the sequence to be amplified be 
present initially in a pure form; it may be a minor fraction of a complex mixture, 
such as contained in whole human DNA. 

10 DNA utilized herein may be extracted from a body sample, such as blood, 

tissue material and the like by a variety of techniques such as that described by 
Maniatis, et al. in Molecular Cloning:A Laboratory Manual, Cold Spring Harbor, 
NY, p 280-281, 1982). If the extracted sample is impure, it may be treated before 
amplification with an amount of a reagent effective to open the cells, or animal 

15 cell membranes of the sample, and to expose and/or separate the strand(s) of the 
nucleic acid(s). This lysing and nucleic acid denaturing step to expose and 
separate the strands will allow amplification to occur much more readily. 

The deoxyribonucleotide triphosphates dATP, dCTP, dGTP, and dTTP are 
added to the synthesis mixture, either separately or together with the primers, in 

20 adequate amounts and the resulting solution is heated to about 90M00°C from 
about 1 to 10 minutes, preferably from 1 to 4 minutes. After this heating period, 
the solution is allowed to cool, which is preferable for the primer hybridization. 
To the cooled mixture is added an appropriate agent for effecting the primer 
extension reaction (called herein "agent for polymerization")/ and the reaction is 

25 allowed to occur under conditions known in the art. The agent for 
polymerization may also be added together with the other reagents if it is heat 
stable. This synthesis (or amplification) reaction may occur at room "temperature 
up to a temperature above which the agent for . polymerization no longer 
functions. Thus, for example, if DNA polymerase is used as the agent, the 
30 temperature is generally no greater than about 40°C. Most conveniently the 
reaction occurs at room temperature. 

The primers used to carry out this invention embrace oligonucleotides of 
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. sufficient length and appropriate sequence to provide initiation of 
polymerization. Environmental conditions conducive to synthesis include the 
presence of nucleoside triphosphates and an agent for polymerization, such as 
DNA polymerase, and a suitable temperature and pH. The primer is preferably 
5 single stranded for maximum efficiency in amplification, but may be double 
stranded. If double stranded, the primer is first treated to separate its strands 
before being used to prepare extension products. The primer must be sufficiently 
long to prirfie the synthesis of extension products in the presence of the inducing 
agent for polymerization. The exact length of primer will depend on many 

10 factors, including temperature, buffer, and nucleotide composition. The 
oligonucleotide primer typically contains 12-20. or more nucleotides, although it 
may contain fewer nucleotides. 

Primers used to carry out this invention are designed to be substantially 
complementary to each strand of the genomic locus to be amplified. This means 

15 that the primers must be sufficiently complementary to hybridize with their 
respective strands under conditions which allow the agent for polymerization to 
perform. In other words, the primers should have sufficient complementarity 
.with the 5' and 3' sequences flanking the mutation to hybridize therewith and 
permit amplification of the genomic locus. 

20 Oligonucleotide primers of the invention are employed in the 

amplification process which is an enzymatic chain reaction that produces 
exponential quantities of polymorphic locus relative to the number of reaction 
steps involved. Typically, one primer is complementary to the negative (-) 
strand of the polymorphic locus and the other is complementary to the positive 

25 (+) strand. Annealing the primers to denatured nucleic acid followed by 
extension with an enzyme, such as the large fragment of DNA polymerase I 
(Klenow) and nucleotides, results in newly synthesized + and - strands 
containing the target polymorphic locus sequence. Because these newly 
synthesized sequences are also templates, repeated cycles of denaturing, primer 

30 annealing, and extension results in exponential production of the region (i.e., the 
target polymorphic locus sequence) defined by the primers. The product of the 
chain reaction is a discreet nucleic acid duplex with termini corresponding to the 
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ends of the specific primers employed. 

The oligonucleotide primers of the invention may be prepared using any 
suitable method, such as conventional' phosphotriester and phosphodiester 
methods or automated embodiments thereof. In one such automated 
embodiment, diethylphosphoramidites are used as starting materials and may be 
synthesized as described by Beaucage, et al., Tetrahedron Letters, 22:1859-1862, 
1981. One method for synthesizing oligonucleotides on a modified solid support 
is described in U.S. Patent No. 4,458,066. 

The agent for polymerization may be any compound or system which will 
function to accomplish the synthesis of primer extension products, including 
enzymes. Suitable enzymes for this purpose include, for example, £. coli DNA 
polymerase I, Klenow fragment of E. coli DNA polymerase, polymerase muteins, 
reverse transcriptase, other enzymes, including heat-stable enzymes (e.i., those 
enzymes which perform primer extension , after being subjected to temperatures 
sufficiently elevated to cause denaturation), such as Taq polymerase. Suitable 
enzyme will facilitate combination of the nucleotides in the proper manner to 
form the primer extension products which are complementary to each 
polymorphic locus nucleic acid strand. Generally, the synthesis will be initiated 
at the 3' end of each primer and proceed in the 5' direction along the template 
strand, until synthesis terminates, producing molecules of different lengths. 

The newly synthesized strand and its complementary nucleic acid strand 
will form a double-stranded molecule under hybridizing conditions described 
above and this hybrid is used in subsequent steps of the process. In the next step, 
the newly synthesized double-stranded molecule is subjected to denaturing 
conditions using any of the procedures described above to provide single- 
stranded molecules. 

The steps of denaturing, annealing, and extension product synthesis can be 
repeated as often as needed to amplify the target polymorphic locus nucleic acid 
sequence to the extent necessary for detection. The amount of the specific nucleic 
acid sequence produced will accumulate in an exponential fashion. 
Amplification is described in PCR. A Prar tical Approach . ILR Press, Eds. M. J. 
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McPherson, P. Quirke, and G. R. Taylor, 1992. 

The amplification products may be detected by Southern blots analysis, 
without using radioactive probes. In such a process, for example, a small sample 
of DNA containing a very low level of the nucleic acid sequence of the 
5 polymorphic locus is amplified, and analyzed via a Southern blotting technique 
or similarly, using dot blot analysis. The use of non-radioactive probes or labels 
is facilitated by the high level of the amplified signal. Alternatively, probes used 
to detect the amplified products can be directly or indirectly detectably labeled, for 
example, with a radioisotope, a fluorescent compound, a bioluminescent 

10 compound, a chemiluminescent compound, a metal chelator or an enzyme. 
Those of ordinary skill in the art will know of other suitable labels for binding to 
the probe, or will be able to ascertain such, using routine experimentation. 

Sequences amplified by the methods of the invention can be further 
evaluated, detected, cloned, sequenced, and the like, either in solution or after 

15 binding to a solid support, by any method usually applied to the detection of a 
specific DNA sequence such as PCR, oligomer restriction ) (Saiki, etal, 
Bio/Technology &\m~lOYl, 1985), allele-specific oligonucleotide (ASO) probe 
analysis (Conner, et al, Proc. Natl. Acad. Sci. U.S.A., fifi:278, 1983), 
oligonucleotide ligation assays (OLAs) (Landgren, et al, Science ,££1:1007, 1988), - 

20 and the like. Molecular techniques for DNA analysis have been reviewed 
(Landgren, et al, Science ,212:229-237, 1988). 

Preferably, the method of amplifying is by PCR, as described herein and as 
is commonly used by those of ordinary skill in the art Alternative methods of 
amplification have been described and can also be employed as long as the 

25 BRCA1 locus amplified by PCR using primers of the invention is similarly 
amplified by the alternative means. Such alternative amplification systems 
include but are not limited to self-sustained sequence replication, which begins 
with a short sequence of RNA of interest and a T7 promoter. Reverse, 
transcriptase copies the RNA into cDNA and degrades the RNA, followed by 

30 reverse transcriptase polymerizing a second strand of DNA. Another nucleic 
acid amplification technique is nucleic acid sequence-based amplification 
(NASBA) which uses reverse transcription and T7 RNA polymerase and 
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incorporates two primers to target its cycling scheme. " NASBA can begin with 
either DNA or RNA and finish with either, and amplifies to 10 8 copies within 60 
to 90 minutes. Alternatively, nucleic acid can be amplified by ligation activated 
transcription (LAT). LAT works from a single-stranded template with a single 
5 primer that is partially single-stranded and partially double-stranded. 
Amplification is initiated by ligating a cDNA to the promoter oligonucleotide 
and within a few hours, amplification is 10 8 to 10 9 fold. Another amplification 
system useful in the method of the invention is the QB Replicase System. The 
QB replicase system can be utilized by attaching an RNA sequence called MDV-1 

10 to RNA complementary to a DNA sequence of interest. Upon mixing with a 
sample, the hybrid RNA finds its complement among the specimen's mRNAs 
and binds, activating the replicase to copy the tag-along sequence of interest. 
Another nucleic acid amplification technique, ligase chain reaction (LCR), works 
by using two differently labeled halves of a sequence of interest which are 

15 covalently bonded by ligase in the presence of the contiguous sequence in a 
sample, forming a new target. The repair chain reaction (RCR) nucleic acid 
amplification technique uses two complementary and target-specific 
oligonucleotide probe pairs, thermostable polymerase and ligase, and DNA 
nucleotides to geometrically amplify targeted sequences. A 2-base gap separates 

20 the oligonucleotide probe pairs, and the RCR fills and joins the gap, mimicking 
DNA repair. Nucleic acid amplification by strand displacement activation (SDA) 
utilizes a short primer containing a recognition site for hincll with short 
overhang on the 5' end which binds to target DNA. A DNA polymerase fills in 
the part of the primer opposite the overhang with sulfur-containing adenine 

25 analogs. Hindi is added but only cuts the unmodified DNA strand. A DNA 
polymerase that lacks 5' exonuclease activity enters at the cite of the nick and 
begins to polymerize, displacing the initial primer strand downstream and 
building a new one which serves as more primer. SDA produces greater than 
10 7 -fold amplification in 2 hours at 37°C. Unlike PGR and LCR, SDA does not 

30 require instrumented Temperature cycling. 

Another method is a process for amplifying nucleic acid sequences from a 
DNA or RNA template which may be purified or may exist in a mixture of 
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nucleic acids. The resulting nucleic acid sequences may be exact copies of the 
template, or may be modified. The process* has advantages over PCR in that it 
increases the fidelity of copying a specific nucleic acid sequence, and it allows one 
to more efficiently detect a particular point mutation in a single assay. A target 
5 nucleic acid is amplified enzymatically while avoiding strand displacement. 
Three primers are used. A first primer is complementary to the first end of the 
target. A second primer is complementary to the second end of the target. A 
third primer which is similar to the first end of the target and which is 
substantially complementary to at least a portion of the first primer such that 
10 when the third primer is hybridized to the first primer, the position of the third 
primer complementary to the base at the 5' end of the first primer contains a 
modification which substantially avoids strand displacement. This method is 
detailed in U.S. Patent 5,593,840 to Bhatnagar et al. 1997. Although PCR is the 
preferred method of amplification if the invention, these other methods can also 
15 be used to amplify the BRCA1 locus as described in the method of the invention. 

The BRCAl(° mi ) DNA coding sequences were obtained by end to end 
sequencing of the BRCA1 alleles of five subjects in the manner described above 
followed by analysis of the data obtained. The data obtained provided us with the 
opportunity to evaluate seven previously published polymorphisms and to 
20 affirm or correct where necessary, the frequency of occurrence of alternative 
codons. 

GENE THERAPY 

The coding sequences can be used for gene therapy. 
A variety of methods are known for gene transfer, any of which might be 
25 available for use. 

Direct injection of Recombinant DNA in vivo 

1. Direct injection of "naked" DNA directly with a syringe and needle into a 
specific tissue, infused through a vascular bed, or transferred through a catheter 
into endothelial cells. 

30 2. Direct injection of DNA that is contained in artificially generated lipid 
vesicles. 

3. Direct injection of DNA conjugated to a targeting structure, such as an 
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antibody. 

4. Direct injection by particle bombardment, where the DNA is coated onto 
gold particles and shot into the cells. 

5 Human Artificial Chromosomes 

This novel gene delivery approach involves the use of human chromosomes 
that have been striped down to contain only the essential components for 
replication and the genes desired for transfer. 

10 Receptor-Mediated Gene Transfer 

DNA is linked to a targeting molecule that will bind to specific cell-surface 
receptors, inducing endocytosis and transfer of the DNA into mammalian' cells. 
One such technique uses poly-L-lysine to link asialoglycoprotein to DNA. An 
adenovirus is also added to the complex to disrupt the lysosomes and thus allow 

15 the DNA to avoid degradation and move to the nucleus. Infusion of these 
particles intravenously has resulted in gene transfer into hepatocytes. 

RECOMBINANT VIRUS VECTORS 

Several vectors are used in gene therapy. Among them are the Moloney Murine 
20 Leukemia Virus (MoMLV) Vectors, the adenovirus vectors, the adeno- 
Associated Virus (AAV) vectors, the herpes simplex virus (HSV) vectors, the 
poxvirus vectors, and human immunodeficiency virus (HIV) vectors, 

GENE REPLACEMENT AND REPAIR 

25 The ideal genetic manipulation for treatment of a genetic disease would be the 
actual replacement of the defective gene with a normal copy of the gene. 
Homologous recombination is the term used for switching out a section of DNA 
and replacing it with a new piece. By. this technique, the defective gene can be 
replaced with a normal gene which expresses a functioning BRCA1 tumor 

30 growth inhibitor protein. 

A complete description of gene therapy can also, be found in "Gene Therapy A 
Primer For Physicians 2d Ed. by Kenneth W. Culver, M.D. Publ. Mary Ann 
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Liebert Inc. (1996). Two Gene Therapy Protocols for BRCA1 are approved by the 
Recombinant DNA Advisory Committee for Jeffrey T. Holt et al. They are listed 
as 9602-148, and 9603-149 and are available from the NIH. The isolated BRCA1 
gene can be synthesized or constructed from amplification products and inserted 
5 into a vector such as the LXSN vector. 

The BRCA1 amino acid and nucleic acid sequence may be used to make 
diagnostic, probes and antibodies. Labeled diagnostic probes may be used by any 
hybridization method to determine the level of BRCA1 protein in serum or lysed 
cell suspension of a patient, or solid surface cell sample. 
10 The BRCA1 amino acid sequence may be used to provide a level of 

protection for patients against risk of breast or ovarian cancer or to reduce the 
size of a tumor. Methods of making and extracting proteins are well known. 
Itakura et al U.S. Patents 4,704,362, 5, 221, 619, and 5,583,013. BRCA1 has been 
shown to be secreted. Jensen, R.A. et al Nature Genetics 12: 303-308 (1996). 

15 

EXAMPLE 1 

Determination Of The Coding Sequence Of A BRCAli flmi l Gene From Five 
Individuals 

20 MATERIALS AND METHODS 

Approximately 150 volunteers were screened in order to identify 
individuals with no cancer history in their immediate family (i.e. first and 
second degree relatives). Each person was asked to fill out a hereditary cancer 
prescreening questionnaire See TABLE I below. Five of these were randomly 

25 chosen for end-to-end sequencing of their BRCA1 gene. A first degree relative is 
a parent, sibling, or offspring. A second degree relative is an aunt, uncle, 
grandparent, grandchild, niece, nephew, or half-sibling. 
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TABLE I 

Hereditary Cancer Pre-Screening Questionnaire 

Fart A; Answer the followine questions about vour family 

1. To your knowledge, has anyone in your family been diagnosed with a very specific 
5 hereditary colon disease called Familial Adenomatous Polyposis (FAP)? 

2. To your knowledge, have you or any aunt had breast cancer diagnosed before the age 35? 

3. Have you had Inflammatory Bowel Disease, also called Crohn's Disease or Ulcerative 
Colitis, for more than 7 years? 

Part B: Refer to the list of cancers belo w for vour responses only to questions in Part B 
10 Bladder Cancer Lung Cancer Pancreatic Cancer 

Breast Cancer Gastric Cancer Prostate Cancer 

Colon Cancer Malignant Melanoma Renal Cancer 

Endometrial Cancer Ovarian Cancer Thyroid Cancer 

4. Have your mother or father, your sisters or brothers or your children had any of the listed 
15 cancers? 

5 Have there been diagnosed in your mother 's brothers or sisters, or your mother 's parents 

more than one of the cancers in the above list? 
6. Have there been diagnosed in your father 's brothers or sisters, or your father' s parents 
more than one of the cancers in the above list? 
20 fart Q Refer to the list of relatives below for responses only to ques t ions in Part C 
You Your mother 

Your sisters or brothers Your mothers's sisters or brothers (maternal aunts and uncles) 
Your children Your mother's parents (maternal grandparents) 

7 Have there been diagnosed in these relatives 2 or more identical types of cancer? 
25 Do not count "simple* skin cancer, also called basal cell or squamous cell skin cancer. 

8. Is there a total of 4 or more of any cancers in the list of relatives above other than 
"simple" skin cancers? 

Fait P; Refer to the list of relatives below for resp onses only to questions in Part D. 
You Your father 

30 Your sisters or brothers Your fathers's sisters or brothers (paternal aunts and uncles) 

Your children Your father's parents (paternal grandparents) 

9. Have there been diagnosed in these relatives 2 or more identical types of cancer? 
Do not count "simple" skin cancer, also called basal cell or squamous cell skin cancer. 

10. Is there a total of 4 or more of any cancers in the list of relatives above other than "simple" 
35 skin cancers? 

© Copyright 1996, OncorMed, Inc. 
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Genomic DNA was isolated from white blood cells of five subjects selected 
from analysis of their answers to the questions above. Dideoxy sequence analysis 
was performed following polymerase chain reaction amplification. 

All exons of the BRCA1 gene were subjected to direct dideoxy sequence 
analysis by asymmetric amplification using the polymerase chain reaction (PCR) 
to generate a single stranded product amplified from this DNA sample. 
Shuldiner, et al, Handbook of Techniques, in Endocrine Research, p. 457-486, 
DePablo,F., Scanes, C, eds., Academic Press, Inc., 1993. Fluorescent dye was 
attached for automated sequencing using the Taq Dye Terminator® Kit (Perkin- 
Elmer cat# 401628): DNA sequencing was performed in both forward and reverse 
directions on an Applied Biosystems, Inc. (ABI) automated Model 377® 
sequencer. The software used for analysis of the resulting data was Sequence 
Navigator® software purchased through ABI. 

l - Polymerase Chain Reacti on (PCR) Ampjif j^ff^ 

Genomic DNA (100 nanograms) extracted from white blood cells of five 
subjects. Each of the five samples was sequenced end to end. Each sample was 
amplified in a final volume of 25 microliters containing 1 microliter (100 
nanograms) genomic DNA, 2.5 microliters 10X PCR buffer (100 mM Tris, pH 8.3, 
500 mM KC1, 1.2 mM MgCl 2 ), 2.5 microliters 10X dNTP mix (2 mM each 
nucleotide), 2.5 microliters forward primer, 2.5 microliters reverse primer, and 1 
microliter Taq polymerase (5 units), and 13 microliters of water. 

The primers in Table H, below were used to carry out amplification of the 
various sections of the BRCA1 gene samples. The primers were synthesized on 
an DNA/RNA Model 394® Synthesizer. 
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TABLE E 

BRCA1 PRIMERS AND SEQUENCING DATA 

SEQ.ID 

EXQN SEQUENCE NO. MER Mg++ SIZE 

5 ~ 

EX0N2 2F 5' GAA GTT GTC ATT TTA TAA ACC TTT-3' 7 24 1.6 -275 

2R 5* TGT CTT TTC TTC CCT AGT ATG T-3' 8 22 

• EX0N3 3F 5' TCC TGA CAC AGC AGA CAT TTA-3' 9 21 1.4 -375 

10 3R 5* TTG GAT TTT CGT TCT CAC TTA-3' 10 21 

EX0N5 5F 5' CTC TTA AGG GCA GTT GTG AG-3' 1 1 2 0 1.2 -275 

5R 5* TTC CTA CTG TCG TTG CTT OC 12 201 

15 . EXON 6 6/7F 5' CTT ATT TTA GTG TCC TTA AAA GG-3' 13 23 1.6 -250 

6R 5* TTT CAT GGA CAG CAC TTG AGT G-3' 14 22 



20 



35 



50 



EX0N7 7F 5' CAC AAC AAA GAG CAT ACA TAG GG-3' 15 2 3 1.6 -275 

6/7R 5' TCG GCT TCA CTC TGT AGA AG-3' 16 20 

EX0N8 8F1 5* TTC TCT TCA GGA GGA AAA GCA-3' 17 21 1.2 -270 

8R1 5* GCT GOC TAC CAC AAA TAC AAA-3* 18 21 



EXON9 9F 5' CCA CAG TAG ATG CTC AGT ^AATA-3* 19 23 1.2 -250 

25 9R 5* TAG GAA AAT ACC AGC TTC ATA GA-3' 20 23 

EXOM10 10F 5' 1GG TCA GCT TTC TGT AAT CG-3' 21 20 1.6 -250 

10R 5' GTA TCT ACC CAC TCT CTT CTT CAG-3' 22 24 

30 EXON 11A11AF 5' CCA CCT CCA AGG TGT ATC A-3' 23 19 1.2 372 

11AR 5' TGT TAT GTT GOC TCC TTG CT-3' 24 20 



EXON11B11BF1 5' CAC TAA AGA CAG AAT GAA TCT A-3; 25 21 1.2 -400 

11BR1 5' GAA GAA CCA GAA TAT TCA TCT A-3 1 26 21 

EXON11C11CF1 5* TGA TGG GGA GTTC TGA ATC AA-3' 27 '20 1.2 -400 

11CR1 5* TCT GCT TTC TTG ATA AAA TCC T-3 1 28 22 



EXON11D11DF1 5' AGC GTC OCC TCA CAA ATA AA-3' 29 20 1.2 -400 

40 11DR1 5' TCA AGC GCA TGA ATA TGC CT-3' 30 20 

EXON11E11EF 5' GTA TAA GCA ATA TGG AAC TCG A-3' 31 22 1.2 388 

11ER 5' TTA AGT TCA CTG GTA TTT GAA CA-3' 3 2 23 

45- EXON 11F11FF 5' GAC AGC GAT ACT TTC CCA GA-3' 33 20 1.2 382 

11FR 5' TOG AAC AAC CAT GAA TTA GTC-3' 34 21 



EXON 11G11GF 5' GGA AGT TAG CAC TCT AGG GA-3' 35 20 1.2 423 

11GR 5' GCA GTO ATA TTA ACT GTTC TGT A-3' 36 22 



1 M13 tailed 



26 






EXON11H11HF 

1 1UQ 

i inn 


5* 

5 


TOG 


GTC 

Gur 


CTT 
GAC 


AAA 
ATT 


GAA ACA 
GAA TCT 


. AAGT-3' 
TCC-3' 


37 
38 


22 
21 


1.2 


366 


5 


EXON11I 1-1 IF 
1 1 in 

i i in 


5' 
o 


CCA 


CTT 

GGA 


TTT 

TGC 


CCC 
TTA 


ATC AAG 
CAA TTA 


" TCA-3' 
CTT C-3' 


39 
40 


21 
21 


1.2 


377 




EXON 11J1 1JF 
11JR 


5' 
5' 


CAA 

t vO 


AAT 
GTA 


TGA 
ACC 


ATG 
CTG 


CTA TGC 
AGC CAA 


TTA GA-3' 
AT-3' 


41 
42 


23 
20 


1.2 


377 


10 


EXON 1 1 K 1 1 KF 
11KR- 


5* 

'15' 


GCA 
TAT 


AAAGCG 
TTG CAG 


TCC 

TO A 


AGA AAG 
AGT CTT 


GA-3 1 
CCA A-3' 


43 
44 


20 
22 


1.2 


396 


15 


EXON11L11LF-1 5 1 
11LR 5' 


GTA 

TAA 

1 MM 


ATA 
AAT 


TTG 
GTO 


GCA 

CTC 


AAG GCA 
CCC AAA 


TCT-3' 
AGC A-3* 


45 
46 


22 
22' 


1.2 


360 


EXON 12 12F 
12R 


5* 
5' 


GTC 

TrtT 
1 O I 


ere 

CAG 


CCA 
CAA 


ATG 
ACC 


AGA AGA 
TAA GAA 


AA-3' 
TGT-3* 


47 
48 


20 
21 


1.2 


-300 


20 


EXON 13 13F 
13R 


5* 
5' 


AAT 

ATP* 


GGA 
TTG 


AAG 
GAG 


CTT 
CTA 


CTC AAAGTA-3' 
GCT CCT TAC-3' 


49 
50 


21 
21 


1.2 


-325 




EXON 14 14F 
14R 


5' 
5' 


CTA 

PTTT3 


ACC 
TAT 


TGA ATT 
AAATGC 


ATC ACT 
CTG TAT 


ATC A-3' 
GCA-3' 


51 
52 


22 
21 


1.2 


-310 


25 


EXON 15 15F 
15R 


5' 
5' 


TOG 

A AO 

A AO 


CTG 
CAG 


CCC 
AAT 


AGG 
ATC 


AAG TAT 
TTT ATG 


G-3' 

TAG GA-3* 


53 
54 


19 
23 


1.2 


-375 


30 


EXON 16 16F 
16R 


5' 
5' 


AAT 

AAA 
MAM 


TCT 
ACT 


TAA 
CTT 


CAG 
TCC 


AGA CCA 
AGA ATG 


GAA C-3 1 
TTG T-3' 


55 
56 


22 
22 


.1.6 


-550 




EXON 17 17F 
17R 


5' 
5» 


GTO 

TPP. 
loo 


TAG 
CCT 


AAC 
CAT 


GTO 
GTO 


CAG GAT TG-3* 
GTT TTA- 3' 


57 
58 


20 
18 


1.2 


-275 


35' 


EXON 18 18F 
18R 


5' 
5* 


QGC 


TCT 


TTA 
ATT 


GCT 
TTC 


TCT TAG 
CCA GCA 


GAC-3' 
TC-3 1 


- 59 
60 


21 
20 


1.2 


-350 




EXON 19 19F 
19R 


5' 
5' 


CTG 

PAT 

1 


TCA 
TGT 


TTC 
TAA 


TTC 
GGA 


CTG TGC 
AAG TOG 


TC-3 1 
TGC-3' 


61 
62 


20 
21 


1.2 


-250 


40 


EXON 20 20F 
20R 


5' 
5 1 


ATA 
\rlX3 


TGA 
AAT 


CGT 
CCA 


GTC 
AAT 


TGC TCC 
TAC ACA 


AC-3' 
GC-3' 


. 63 
64 


20 
20 


1.2 


-425 


45 


EXON 21 21 F 
21R 


5' 
5* 


AAG 

U 1 A 


CTC 
GAG 


TTC 
AAA 


CTT 
TAG 


TTT GAA 
AAT AGC 


AGT C-3* 
CTC T-3' 


65 
66 


22 
22 


1.6 


-300 




EXON 22 22F 
22R 


5* 
5' 


TCC 
GAG 


CAT 
AAG 


TGA 
ACT 


GAG 
TCT 


GTC TTG 
GAG GCT 


CT-3' 
AC-3* 


67 
68 


20 
20 


1 .6 


-300 


50 


EXON 23 23F-1 
23R-1 


5' 
5' 


TGA 
CAT 


AGT 
TTT 


GAC 
AGC 


AGT 
CAT 


TCC AGT 
TCA TTC 


AGT-3 1 
AAC AA-3' 


69 

70 . 


21 
23 


1.2 


-250 




EXON 24 24F 
24R 


5' 
5' 


ATG 
GTA 


AAT 
GX 


TGA 
AGG 


CAC 
ACA 


TAA TCT 
GTA GAA 


CTG C-3' 
GGA-3' 


71 
72 


22 
21 


1 .4 


-285 
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Thirty-five cycles were performed, each consisting of denaturing (95°C; 30 
seconds), annealing (55°C; 1 minute), and extension (72°C; 90 seconds), except during 
the first cycle in which the denaturing time was increased to 5 minutes, and. during the 
last cycle in which the extension time was increased to 5 minutes. 
5 PCR products were purified using Qia-quick® PCR purification kits (Qiagen cat# 

28104; Chatsworth, CA). Yield and purity of the PCR product determined 
spectrophotometrically at OD 2 6o on a Beckman DU 650 spectrophotometer. 

2. Dideoxy Sequence Analysis 

Fluorescent dye was attached to PCR products for automated sequencing using the Taq 
10 Dye Terminator® Kit (Perkin-EImer cat# 401628). DNA sequencing was performed in 
both forward and reverse directions on an Applied Biosystems, Inc. (ABI) Foster City, 
CA., automated Model 377® sequencer. The software used for analysis of the resulting 
data was "Sequence Navigator® software" purchased through ABL 

3. RESULTS 

15 Differences in the nucleic acids of the ten alleles from five individuals were found in 
seven locations on the gene. The changes and their positions are found on TABLE HI, 
below. 
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5 AMINO Nucleotide 
ACID CHANGE Change 1 



TABLE III 
PANEL TYPING 



FREQUENCY 



10 



SER(SER) 
(694) 

LEU(LEU) 
(771) 



11E 



11F 



C/C C/T C/T T/T T/T 



T/T C/T C/T C/C C/C 



0.4 C 
0.6 T 

0.4 T 
0.6 C 



PRO(LEU) 
15 (871) 



11G 



C/T C/T C/T T/T T/T 



0.3 C 
0.7 T 



GLU(GLY) 
(1038) 



111 



A/ A A/G A/G G/G G/G 



0.4 A 
0.6 G 



20 LYS(ARG) 
(1183) 



1 1J 



A/ A A/G A/G G/G G/G 



0.4 A 
0.6 G 



25 



SER(SER) 
(1436) 

SER(GLY) 
(1613) 



13 



16 



T/T T/T T/C C/C C/C 



A/ A A/G A/G G/G G/G 



0.5 T 
0.5 C 

0.4 A 
0.6 G 
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Tables 3 and 4 depict one aspect of the invention, sets of at least two alternative 
codon pairs wherein the codon pairs occur' in the following frequencies, 
respectively, in a population of individuals free of disease: 

at position 2201, AGC and AGT occur at frequencies from about 35- 
45%, and from about 55-65%, respectively; 

at position 2430, TTG and CTG occur at frequencies from about 35- 
45%, and from about 55-65%, respectively; 

at position 2731, CCG and CTG occur at frequencies from about 25- 
35%, and from about 65-75%, respectively; 

at position 3232, GAA and GGA occur at frequencies from about 35- 
45%, and from about 55-65%, respectively; 

at position 3667, AAA and AGA occur at frequencies from about 35- 
45%, and from about 55-65%, respectively; 

at position 4427, TCT and TCC occur at frequencies from about 45- 
55%, and from about 45-55%, respectively; and 
at position 4956, AGT and GGT occur at frequencies from about 35- 
45%, and from about 55-65%, respectively. 



The data show that for each of the samples. The BRCA1 gene is identical except 
20 in the region of seven polymorphisms. These polymorphic regions, together with their 
locations, the amino acid groups of each codon, the frequency of their occurrence and 
the amino acid coded for by each codon are found in TABLE IV below. 

TABLE IV ' 

25 CODON AND BASE CHANGES IN SEVEN POLYMORPHIC SITES OF BRCA1 GENE 

SAMPLE BASE ■ POSmON CCOCN AA PUBLISHED FREQUENCY 

NAME CHANGE nt/aa EXCN CHANGE CHANGE FREQUENCY 2 IN THIS STUDY 
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2,3,4,5 C-T 2201/694 11E AGC(AGT) SER-SER UNPUBUSHED C=40% 



2,3,4,5 T-C 2430/771 1 1 F TTG (CTG) LEU-LEU T=67% 13 



T=40^ 



1,2,3,4,5 C-T 2731/871 11G CCG(CTG) PRO-LEU C=34% 12 



C=30% 
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2,3,4,5 A-G 3232/1038 111 GAA(GGA) GLU-GLY A=67% 13 



A=40% 



2,3,4,5 A-G 3667/1183 1 1 J AAA(AGA) LYS-ARG A=68% 12 



A=40% 



5 3.4,5 T-C 4427/1436 13 TCT(TCC) SER-SER T=67% t2 



T=50% 



2,3,4,5 A-G 4956/1613 16 AGT(GGT) SER-GLY A=67% 12 



A =40% 



2 Reference numbers correspond to the Table of References below. 
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EXAMPLE 2 

Determination Of A Individual Using BRCAlI QMI l And The Seven Polymorp hism* 
For Reference 

A person skilled in the art of genetic susceptibility testing will find the present 
5 invention useful for: 

a) identifying individuals having a BRCA1 gene, who are therefore have no 
elevated genetic susceptibility to breast or ovarian cancer from a BRCA1 
mutation; 

b) avoiding misinterpretation of polymorphisms found in the 
10 BRCA1 gene; 

Sequencing is carried out as in EXAMPLE 1 using a blood sample from the patient in 
question. However, a BRCAK omi ) sequence is used for reference and the polymorphic 
sites are compared to the nucleic acid sequences listed above for codons at each 
polymorphic site. A sample is one which compares to a BRCAl(° mi > sequence and 
15 contains one of the base variations which occur at each of the polymorphic sites. The 
codons which occur at each of the polymorphic sites are paired here reference. 

AGC and AGT at position 2201, 

TTG and CTG at position 2430, 

• CCG and CTG at position 2731, 
20 • GAA and GG A at position 3232, 

• AAA and AGA at position 3667, 

• TCT and TCC at position 4427, and 
AGT and GGT at position 4956. 

The availability of these polymorphic pairs provides added assurance that one skilled in 
25 the art can correctly interpret the polymorphic variations without mistaking a 
variation for a mutation. 

Exon 11 of the BRCA1 gene is subjected to direct dideoxy sequence analysis by 
asymmetric amplification using the polymerase chain reaction (PCR) to generate a 
single stranded product amplified from this DNA sample. Shuldiner, et al, Handbook 
30 of Techniques in Endocrine Research, p. 457-486, DePablo,F., Scanes, C., eds., Academic 
Press, Inc., 1993. Fluorescent dye is attached for automated sequencing using the Taq 
Dye Terminator® Kit (Perkin-Elmer cat# 401628). DNA sequencing is performed in 
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both forward and reverse directions on an Applied Biosystems, Inc. (ABI) automated 
Model 377® sequencer. The software used for analysis of the resulting data is "Sequence 
Navigator® software" purchased through ABI. 

5 1. Polymerase Chain Reaction (PCR) Amplification 

Genomic DNA (100 nanograms) extracted from white blood cells of the subject is 
amplified in a final volume of 25 microliters containing 1 microliter (100 nanograms) 
genomic DNA, 2.5 microliters 10X PCR buffer (100 mM Tris, pH 8.3, 500 mM KC1, 1.2 
mM MgCl 2 )/ 2.5 microliters 10X dNTP mix (2 mM each nucleotide), 2.5 microliters 

10 . forward primer (BRCA1-11K-F, 10 micromolar solution), 2.5 microliters; reverse primer 
(BRCA1-11K-R, 10 micromolar solution),and 1 microliter Taq polymerase (5 units), and 
13 microliters of water. 

The PCR primers used to amplify a patient's sample BRCA1 gene are listed in Table II. 
The primers were synthesized on an DNA/RNA Model 394® Synthesizer. Thirty-five 
15 cycles are of amplification are performed, each consisting of denaturing (95°C; 30 
seconds), annealing (55°C; 1 minute), and extension (72°C; 90 seconds), except during 
the first cycle in which the denaturing time is increased to 5 minutes, and during the 
last cycle in which the extension time is increased to 5 minutes. 

PCR products are purified using Qia-quick® PCR purification kits (Qiagen, cat# 28104; 
20 Chatsworth, CA). Yield and purity of the PCR product determined 
spectrophotometrically at OD 2 6o on a Beckman DU 650 spectrophotometer. 

2. Dideoxy Sequence Analysis 

Fluorescent dye is attached to PCR products for automated sequencing using the Taq - 
25 Dye Terminator® Kit (Perkin-Elmer cat# 401628). DNA sequencing is performed in 
both forward and reverse directions on an Applied Biosystems, Inc. (ABI) Foster City, 
CA., automated Model 377® sequencer. The software used for analysis of the resulting 
data is "Sequence Navigator® software" purchased through ABI. The BRCAl( omil ) SEQ. 
ID. NO.:l sequence is entered into the Sequence Navigator® software as the Standard for 
30 comparison. The Sequence Navigator® software compares the sample sequence to the 
BRCAl(° mil ) SEQ. ID. NO.:l standard, base by base. The Sequence Navigator® software 
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highlights all differences between the BRCAK° m >U SEQ. ID. N0.:1 DNA sequence and 
the patient's sample sequence. 

A first technologist checks the computerized results by comparing visually the 
BRCAl(°n\il) SEQ. ID. NO.:l standard against the patient's sample, and again highlights 
5 any differences between the standard and the sample. The first primary technologist 
then interprets the sequence variations at each position along the sequence. 
Chromatograms from each sequence variation are generated by the Sequence 
Navigator® software and printed on a color printer. The peaks are interpreted by the 
first primary technologist and a second primary technologist. A secondary technologist 
10 then reviews the chromatograms. The results are finally interpreted by a geneticist. In 
each instance, a variation is compared to known polymorphisms for position and base 
change. If the sample BRCA1 sequence matches the BRCAK 0 ™*) SEQ. ID. NO.:l 
standard, with only variations within the known list of polymorphisms, it is 
interpreted as a gene sequence. 

15 

EXAMPLE 3 

DETERMINING THE ABSENCE OF A MUTATION IN TH E BRCA1 GENE TISTMn 
BBCAlkmm AND SEVEN POLYMORPHISM S FOR REFERENCE 

A person skilled in the art of genetic susceptibility testing will find the present 
20 invention useful for determining the presence of a known or previously unknown 
mutation in the BRCA1 gene. A list of mutations of BRCA1 is publicly available in the 
Breast Cancer Information Core at: 
^ http:/ /www.nchgr.nih.gov/dir/lab_transfer/bic. This data, site became publicly 
available on November 1, 1995. Friend, S. et al. Nature Genetics 11:238, (1995). 
25 Sequencing is carried out as in EXAMPLE 1 using a blood sample from the patient in 
question. However, a BRCAK°™) sequence is used for reference and polymorphic sites 
are compared to the nucleic acid sequences listed above for codons at each polymorphic 
site. A sample is one which compares to the BRCAK 00 ^ SEQ. ID. NO.: 3 sequence and 
contains one of the base variations which occur at each of the polymorphic sites. The 
30 codons which occur at each of the polymorphic sites are paired here reference. 
AGC and AGT at position 2201, 
TTG and CTG at position 2430, 
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CCG and CTG at position 2731, 

• GAA and GGA at position 3232, 

• AAA and AGA at position 3667, 

• TCT and TCC at position 4427, and 
5 • AGT and GGT at position 4956. 

The availability of these polymorphic pairs provides added assurance that one skilled in 
the art can correctly interpret the polymorphic variations without mistaking a 
variation for a mutation. 

Exon 11 of the BRCA1 gene is subjected to direct dideoxy sequence analysis by 

10 asymmetric amplification using the polymerase chain reaction (PCR) to generate a 
single stranded product amplified from this DNA sample. Shuldiner, et al, Handbook 
of Techniques in Endocrine Research, p. 457-486, DePablo,F., Scanes, C, eds., Academic 
Press, Inc., 1993. Fluorescent dye is attached for automated sequencing using the Taq 
Dye Terminator® Kit (Perkin-Elmer cat# 401628). DNA sequencing is performed in 

15 both forward and reverse directions on an Applied Biosystems, Inc. (ABI) automated 
Model 377® sequencer. The software used for analysis of the resulting data is "Sequence 
Navigator® software" purchased through ABI. 

1. Polymerase Chain Rea ction fPCR^ Amplification 

20 Genomic DNA (100 nanograms) extracted from white blood cells of the subject is 
amplified in a final volume of 25 microliters containing 1 microliter (100 nanograms) . 
genomic DNA, 2.5 microliters 10X PCR buffer (100 mM Tris, pH 8.3, 500 mM KC1, 1.2 
mM MgCl 2 ), 2.5 microliters 10X dNTP mix (2 mM each nucleotide), 2.5 microliters 
forward primer (BRCA1-11K-F, 10 micromolar solution), 2.5 microliters reverse primer 

25 (BRCA1-11K-R, 10 micromolar solution),and 1 microliter Taq polymerase (5 units), and 
13 microliters of water. 

The PCR primers used to amplify a patient's sample BRCA1 gene are listed in Table II. 
The primers were synthesized on an DNA/RNA Model 394® Synthesizer. Thirty- five 
cycles are of amplification are performed, each consisting of denaturing (95 3 C; 30 
30 seconds), annealing (55°C; 1 minute), and extension (72°C; 90 seconds), except during 
the first cycle in which the denaturing time is increased to 5 minutes, and during the 
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last cycle in which the extension time is increased to 5 minutes. 

PCR products are purified using Qia-quick® PCR purification kits (Qiagen, cat* 
28104; Chatsworth, CA). Yield and purity of the PCR product determinec 
spectrophotometrically at OD 260 on a Beckman DU 650 spectrophotometer. 

2 - Dideoxy S equence Analy« si ? 

Fluorescent dye is attached to PCR products for automated sequencing using the Taq 
Dye Terminator® Kit (Perkin-Elmer cat# 401628). DNA sequencing is performed in 
both forward and reverse directions on an Applied Biosystems, Inc. (ABI) Foster City, 
CA., automated Model 377® sequencer. The software used for analysis of the resulting 
data is "Sequence Navigator® software" purchased through ABL The BRCAK°mi2) SE Q. 
ID. NO.: 3 sequence is entered into the Sequence Navigator® software as the Standard 
for comparison. The Sequence Navigator® software compares the sample sequence to 
the BRCAKoma) SEQ. ID. NO, 3 standard, base by base. The Sequence Navigator® 
software highlights all differences between the BRCAl^) SEQ.. ID. NO.: 3 DNA 
sequence and the patierif s sample sequence. 

A first technologist checks the computerized results by comparing visually the 
BRCAie-tf) SEQ. ID, NO, 3 standard against the patient's sample, and again highlights 
any differences between the standard and the sample. The first primary technologist 
then interprets the sequence variations at each position along the sequence. 
Chromatograms from each sequence variation are generated by the Sequence 
Navigator® software and printed on a color printer. The peaks are interpreted by the 
first primary technologist and also by a second primary technologist. A secondary 
technologist then reviews the chromatograms. The results are finally interpreted by a 
geneticist. In each instance, a variation is compared to known polymorphisms for 
position and base change. If the sample BRCA1 sequence matches the BRCAl^SEQ. 
ID. NO, 3 standard, with only variations within the known list of polymorphisms, it is 
interpreted as a gene sequence. 
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EXAMPLE 4 

DETERMINING THE PRESENCE OF A MUTATION IN THE BRCA1 GENF IISTNIf 
BRCAlimul AND SEVEN POLYMORPHISMS FOR REFERENCE 

A person skilled in the art of genetic susceptibility testing will find the present 
5 invention useful for determining the presence of a known or previously unknowr 
mutation in the BRCA1 gene. A list of mutations of BRCA1 is publicly available in the 
Breast Cancer Information Core at: 

r— - http://www.nchgr.nih.gov/dir/lab_transfer/bic. This data site became publicly 
available on November 1, 1995. Friend, S. et al Nature. Genetics 11:238, (1995). In this 

10 example, a mutation in exon 11 is characterized by amplifying the region of the 
mutation with a primer which matches the region of the mutation. 
Exon 11 of the BRCA1 gene is subjected to. direct dideoxy sequence analysis by 
asymmetric amplification using the polymerase chain reaction (PGR) to generate a 
single stranded product amplified from this DNA sample. Shuldiner, et al., Handbook 

15 of Techniques in Endocrine Research, p. 457-486, DePablo,R, Scanes, C, eds., Academic 
Press, Inc., 1993. Fluorescent dye is attached for automated sequencing using the Taq 
Dye Terminator® Kit (Perkin-Elmer cat# 401628). DNA sequencing is performed in 
both forward and reverse directions on an Applied Biosystems, Inc. (ABI) automated 
Model 377® sequencer. The software used for analysis of the resulting data is "Sequence 

20 Navigator® software" purchased through ABI. 

1. Polymerase Chain Reaction (PCR^ Amplifirarinn 

Genomic DNA (100 nanograms) extracted from white blood cells of the subject is 
amplified in a final volume of 25 microliters containing 1 microliter (100 nanograms) 
25 genomic DNA, 2.5 microliters 10X PCR buffer (100 mM Tris, pH 8.3, 500 mM KC1, 1.2 
mM MgCl 2 ), 2.5 microliters 10X dNTP mix (2 mM each nucleotide), 2.5 microliters 
forward primer (BRCA1-11K-F, 10 micromoiar solution), 2.5 microliters reverse primer 
(BRCA1-11K-R, 10 micromoiar solution),and 1 microliter Taq polymerase (5 units), and 
13 microliters of water. 

30 The PCR primers used to amplify segment K of exon 11 (where the mutation is found) 
are as follows: 
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BRCAM1K-F: 5'-GCA AAA GCG.TCC AGA AAG GA-3' SEQ ID NO:69 

BRCA1-11K-R: 5'-AGT CTT CCA ATT CAC TGC AC-3' SEQ ID NO:70 

The primers are synthesized on an DNA/RNA Model 394® Synthesizer. 

Thirty-five cycles are performed, each consisting of denaturing (95°C; 30 seconds), 

annealing (55°C; 1 minute), and extension (72°C; 90 seconds), except during the firs! 

cycle in which the denaturing time is increased to 5 minutes, and during the last cycle 

in which the extension time is increased to 5 minutes. 

PCR products are purified using Qia-quick® PCR purification kits (Qiagen, cat# 28104; 
Chatsworth, CA). Yield and purity of the. PCR product determined 
spectrophotometrically at OD 260 on a Beckman DU 650 spectrophotometer. 

2- Dideoxv Sequence Analysis 

Fluorescent dye is attached to PCR products for automated sequencing using the Taq 
Dye Terminator® Kit (Perkin-Elmer cat# 401628). DNA sequencing is performed in 
both forward and reverse directions on an Applied Biosystems, Inc. (ABI) Foster City, 
C A., automated Model 377® sequencer. The software used for analysis of the resulting 
data is "Sequence Navigator® software" purchased through ABI. The.BRCAl( omi2 ) SEQ. 
ID. NO.: 3 sequence is entered into the Sequence Navigator® software as the Standard 
for comparison. The Sequence Navigator® software compares the sample sequence to 
the BRCAK°mi2) SEQ. ID. NO.: 3 standard, base by base. The Sequence Navigator® 
software highlights all differences between the BRCAl( omi2 ) SEQ. ID. NO.: 3 DNA 
sequence and the patient's sample sequence. 

A first technologist checks the computerized results by comparing visually the 
BRCAl< omi2 ) SEQ. ID. NO.: 3 standard against the patient's sample, and again highlights 
any differences between the standard and the sample. The first primary technologist 
then interprets the sequence variations at each position along the sequence. 
Chromatograms from each sequence variation are generated by the Sequence 
Navigator® software and printed on a color printer. The peaks are interpreted by the 
first primary technologist and a second primary technologist A secondary technologist 
then reviews the chromatograms. The results are finally interpreted by a geneticist In 
each instance, a variation is compared to known polymorphisms for position, and base 
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change. Mutations are noted by the length of non-matching variation. Such a lengthy 
mismatch pattern occurs with deletions and substitutions. 

3. Result 

5 Using the above PGR amplification and standard fluorescent sequencing 

technology, The 3888delGA mutation may. be found. The 3888delGA mutation The 
3RCA1 gene lies in segment "K" of exon 11. The DNA sequence results demonstrate 
the presence of a two base pair deletion at nucleotides 3888 and 3889 of the published 
BRCAK omi ) sequence. This mutation interrupts the reading frame of the BRCA1 

10 transcript, resulting in the appearance of an in-frame terminator (TAG) at codon 
position 1265. This mutation is, therefore, predicted to result in a truncated, and most, 
likely, non-functional protein. The formal name of the mutation will be 3888delGA. 
This mutation is named in accordance with the suggested nomenclature for naming 
mutations, Baudet, A et al, Human Mutation 2:245-248, (1993). 

15 

EXAMPLES 

USE OF THE BRCAli flm m GENE THERAPY 

The growth of ovarian, breast or prostate cancer can be arrested by increasing the 
20 expression of the BRCA1 gene where inadequate expression of that gene is responsible 
for hereditary ovarian, breast and prostate cancer. It has been demonstrated that 
transfection of BRCA1 into cancer cells inhibits their growth and reduces 
tumorigenesis. Gene therapy is performed on a patient to reduce the size of a tumor. 
The LXSN vector is transformed with any of the BRCAK omil > SEQ. ID. NO.:l, 
25 BRCAK^) SEQ. ID. NO.:3, or BRCAK omi3 ) SEQ. ID. NO.:5 coding region. 

Vector 

The LXSN vector is transformed with wild type BRCAl< omil > SEQ. ID. NO.:l coding 
sequence. The LXSN-BRCAK omil ) retroviral expression vector is constructed by cloning 
30 a Srt/I-linkered BRCAl(° mil > cDNA (nucleotides 1-5711) into the Xhol site of the vector 
LXSN. Constructs are confirmed by DNA sequencing. Holt et al. Nature Genetics 12: 
298-302 (1996). 
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Retroviral vectors are manufactured from viral producer cells using serum free and 

phenol-red free conditions and tested for sterility, absence of specific pathogens, and 

absence of replication-competent retrovirus by standard assays. Retrovirus is stored 

frozen in aliquots which have been tested. 
5 Patients receive a complete physical exam, blood, and urine tests to determine 

overall health. They may also have a chest X-ray, electrocardiogram, and appropriate 

radiologic procedures to assess tumor stage. 

Patients with metastatic ovarian cancer are treated with retroviral gene therapy by 

infusion of recombinant LXSN-BRCA1(°™D retroviral vectors into peritoneal sites 
10 containing tumor, between 10 9 and 10™ viral particles per dose. Blood samples are 

drawn each day and tested for the presence of retroviral vector by sensitive polymerase 

chain reaction (PCR)-based assays. The fluid which is removed is analyzed to 

determine: 

15 1. The percentage of cancer cells which are taking up the recombinant LXSN- 
BRCAl(omii) retroviral vector combination. Successful transfer of BRCA1 gene into 
cancer cells is shown by both RT-PCR analysis and in situ hybridization. 
RT-PCR is performed with by the method of Thompson et al. Nature Genetics 9_: 444- 
450 (1995), using primers derived from BRCA1(°™1) SEQ. ID. NO.:l. Cell lysates are 
prepared and immunoblotting is performed by the method of Jensen et al. Nature 
Genetics 12: 303-308 1996) and Jensen et al. Biochemistry 21: 10887-10892 (1992). 
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2. Presence of programmed cell death using ApoTAG® in situ apoptosis detection kit 
(Oncor, Inc., Gaithersburg, Maryland) and DNA analysis. 

3. Measurement of BRCA I gene expression by slide immunofluorescence or western 
blot. 

Patients with measurable disease are also evaluated for a clinical response to LXSN- 
BRCAI, especially those that do not undergo a palliative intervention immediately after 
30 retroviral vector therapy. Fluid cytology, abdominal girth, CT scans of the abdomen, 
and local symptoms are followed. 
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For other sites of disease, conventional response criteria are used as follows: 

1. Complete Response (CR), complete disappearance of all measurable lesions 
and of all signs and symptoms of disease for at least 4 weeks. 

2. Partial Response (PR), decrease of at least 50% of the sum of the products of the 
5 2 largest perpendicular diameters of all measurable lesions as determined by 2 

observations not less than 4 weeks apart. To be considered a PR, no new lesions should 
have appeared during this period and none should have increased in size. 

3. Stable Disease, less than 25% change in tumor volume from previous 
evaluations. 

10 4. Progressive Disease, greater than 25% increase in tumor measurements from 
prior evaluations. 

The number of doses depends upon the response to treatment. 
For further information related to this gene therpay approach see in "BRCA1 
Retroviral Gene Therapy for Ovarian Cancer" a Human Gene Transfer Protocol: NIH 
15 ORD A Registration #: 9603-149 Jeffrey Holt, JT, M.D. and Carlos L. Arteaga, M.D. 
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"Breast and Ovarian cancer" is understood by those skilled in the art to 
include breast and ovarian cancer in women and also breast and prostate cancer in men. 
BRCA1 is associated genetic susceptibility to inherited breast and ovarian cancer in 
women and also breast and prostate cancer in men. Therefore, claims in this document 
which recite breast and/or ovarian cancer refer to breast, ovarian and prostate cancers in 
men and women. Although the invention has been described with reference to the 
presently preferred embodiments, it should be understood that various, modifications 
can be made without departing from the spirit of the Invention. Accordingly, the 
invention is limited only by the following claims. 
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